Feature ranking for multi-label classification using predictive clustering trees
نویسنده
چکیده
In this work, we present a feature ranking method for multilabel data. The method is motivated by the the practically relevant multilabel applications, such as semantic annotation of images and videos, functional genomics, music and text categorization etc. We propose a feature ranking method based on random forests. Considering the success of the feature ranking using random forest in the tasks of classification and regression, we extend this method for multi-label classification. We use predictive clustering trees for multi-label classification as base predictive models for the random forest ensemble. We evaluate the proposed method on benchmark datasets for multi-label classification. The evaluation of the proposed method shows that it produces valid feature rankings and that can be successfully used for performing dimensionality reduction.
منابع مشابه
MLIFT: Enhancing Multi-label Classifier with Ensemble Feature Selection
Multi-label classification has gained significant attention during recent years, due to the increasing number of modern applications associated with multi-label data. Despite its short life, different approaches have been presented to solve the task of multi-label classification. LIFT is a multi-label classifier which utilizes a new strategy to multi-label learning by leveraging label-specific ...
متن کاملDetection of Visual Concepts and Annotation of Images Using Ensembles of Trees for Hierarchical Multi-Label Classification
In this paper, we present a hierarchical multi-label classification system for visual concepts detection and image annotation. Hierarchical multi-label classification (HMLC) is a variant of classification where an instance may belong to multiple classes at the same time and these classes/labels are organized in a hierarchy. The system is composed of two parts: feature extraction and classificat...
متن کاملEvaluation of Different Data-Derived Label Hierarchies in Multi-label Classification
Motivated by an increasing number of new applications, the research community is devoting an increasing amount of attention to the task of multi-label classification (MLC). Many different approaches to solving multi-label classification problems have been recently developed. Recent empirical studies have comprehensively evaluated many of these approaches on many datasets using different evaluat...
متن کاملOrganization Workshop Co-chairs Program Committee Additional Referees an Ensemble Method for Multi-label Classification Using a Transportation Model 49 Ignoring Co-occurring Sources in Learning from Multi-labeled Data Leads Evaluation of Distance Measures for Hierarchical Multi-label Classification in Functional Genomics
Hierarchical multi-label classification (HMLC) is a variant of classification where instances may belong to multiple classes that are organized in a hierarchy. The approach we used is based on decision trees and is set in the predictive clustering trees framework (PCTs), which is implemented in the CLUS system. In this work, we are investigating how different distance measures for hierarchies i...
متن کاملCOMPARISON OF DISTANCES FOR MULTI-LABEL CLASSIFICATION WITH PCTs
Multi-label classification has received significant attention in the research community over the past few years: this has resulted in the development of a variety of multi-label classification methods. These methods either transform the multi-label dataset to several simpler datasets or adapt the learning algorithm so it can handle the multiple labels. In this paper, we consider the latter appr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013